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RAW SEQUENlJiS LISTING 

PATENT APPLICATION US/0S/467,6b5 



DATE: 12/18/96 
TIME: 11:47:04 



INPVTSET: S14460.raw 
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This Raw Listing contains the General 



Information Section and up to the first 5 pnffi.^ 




SEQUENCE LISTING 

(1) General Information: 

(i) APPLICANT: Pietropaolo, Massimo and Eisenbarth, George S. 
(ii) TITLE OF INVENTION: Antigen Associated with Type I Diabetes Mellitus 
(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 State Street, suite 510 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/467,605 

(B) FILING DATE: 06 June 1995 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/788,118 

(B) FILING DATE: 01 November 1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Silveri, Jean M. 

(B) REGISTRATION NUMBER: 39,030 

(C) REFERENCE/DOCKET NUMBER: IMI-OOICPDV 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 



PAGF: 2 RAW SEQUENCE LISTING DATE: 12/18/96 

: PATENT APPLICATION US/d8/467,605 TIME: li:47:07^ 

INPUT SET: S14460.raw 

4.1 (A) LENGTH: 1785 base pairs 

48 (B) TYPE: nucleic acid 

49 (C) STRANDEDNESS: single 

50 (D) TOPOLOGY: linear 
51 

52 (ii) MOLECULE TYPE: cDNA 

53 

54 

55 (ix) FEATURE: 

56 (A) NAME/KEY: CDS 

57 (B) LOCATION: 179.. 1628 
58 

59 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

61 

62 CGGGCGGGGG ATACCCCAGG AGATGGGGGT CGAGGAGAGA CCCCGGGGAG TAGAGAGAGA 60 
63 

64 GAAACTCACT CCCCGAGTCC CCGACCCTCC CCAAGCAAGG TTATAATATA ACTTATCCTC 120 
65 

66 TCATGCTTTT TTCCTGCCCC TTCTCCCCAA ATCATCAACA ATAGAAGAAG AAGAAAAC 178 
67 

68 ATG TCA GGA CAC AAA TGC AGT TAT CCC TGG GAC TTA CAG GAT CGA TAT 226 

69 Met Ser Gly His Lys Cys Ser Tyr Pro Trp Asp Leu Gin Asp Arg Tyr 

70 1 5 10 15 
71 

72 GCT CAA GAT AAG TCA GTT GTA AAT AAG ATG CAA CAG AG A TAT TGG GAG 274 

73 Ala Gin Asp Lys Ser Val Val Asn Lys Met Gin Gin Arg Tyr Trp Glu 

74 20 25 30 
75 

76 ACG AAG CAG GCC TTT ATT AAA GCC ACA GGG AAG AAG GAA GAT GAA CAT 322 

77 Thr Lys Gin Ala Phe lie Lys Ala Thr Gly Lys Lys Glu Asp Glu His 

78 35 40 45 
79 

80 GTT GTT GCC TCT GAC GCG GAC CTG GAT GCC AAG CTA GAG CTG TTT CAT 370 

81 Val Val Ala Ser Asp Ala Asp Leu Asp Ala Lys Leu Glu Leu Phe His 

82 50 55 60 
83 

84 TCA ATT CAG AGA ACC TGT CTG GAC TTA TCG AAA GCA ATT GTA CTC TAT 418 

85 Ser lie Gin Arg Thr Cys Leu Asp Leu Ser Lys Ala lie Val Leu Tyr 

86 65 70 75 80 
87 

88 CAA CAG AGG ATA TGT TTC TTG TCT CAA GAA GAA AAC GAA CTG GGA AAA 466 

89 Gin Gin Arg lie Cys Phe Leu Ser Gin Glu Glu Asn Glu Leu Gly Lys 

90 85 90 95 
91 

92 TTT CTT CGA TCC CAA GGT TTC CAA GAT AAA ACC AGA GCA GGA AAG ATG 514 

93 Phe Leu Arg Ser Gin Gly Phe Gin Asp Lys Thr Arg Ala Gly Lys Met 

94 100 105 110 
95 

96 ATG CAA GCG ACA GGA AAG GCC CTC TGC TTT TCT TCC CAG CAA AGG TTG 562 

97 Met Gin Ala Thr Gly Lys Ala Leu Cys Phe Ser Ser Gin Gin Arg Leu 

98 115 120 125 
99 
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RAW SEQUENCE LISTING 

PATENT APPLICATION ' mm/467, 605 



DATE: 12/18/96 
TIME: 11:47: ID 



INPUT SET: S14460.mw 



100 


GCC 


TTA 


CGA 


AAT 


CCT 


TTG 


TGT 


CGA 


TTT 


CAC 


CAA 


GAA 


GTG 


GAG 


ACT 


TTT 


610 


101 


Ala 


Leu 


Arg Asn 


Pro 


Leu 


Cvs 


Ara 


Phe 


His 


Gin 


Glu 


Val 


Glu 


Thr 


Phe 




102 




130 

X •J V 










135 










140 












103 




































104 


CGG 


CAT 


CGG 


GCC 


ATC 


TCA 


GAT 


ACT 


TGG 


CTG 


ACG 


GTG 


AAC 


CGC 


ATG 


GAA 


658 


105 


Ara 


His 


Arg Ala 


lie 


Ser 


ASD 


Thr 


Tro 


Leu 


Thr 


Val 


Asn Arg 


Met 


Glu 




106 

J. u u 


145 

Xt -J 










150 










155 










160 




107 




































108 


CAG 


TGC 




ACV9 


GAA 


TAT 


AGA 


GGA 


GCA 


CTA 


TTA 


TGG 


A 1 u 


a an 

AAo 


GAC 


GTG 


706 


109 


Gin 


Cys 


TV irn 
Ax ^ 


illX 


Glu 


Tvr 
X y X 


Arcr 


Glv 

oxy 


Ala 


Leu 


Leu 


Tro 

X x^ 




Xjy o 


Asp 


Val 




X J. u 










16 5 
X D 3 










1 70 

X / V 










1 75 
X / ^ 






111 




































112 


TCT 


CAG 




U i J. 


GAT 


CCA 


GAC 


CTC 


TAG 


AAG 


CAA 


ATG 


wAu 


a An 

AAU 


TTC 


AGG 


754 


113 


Ser 


Gin 




Leu 


Asp 


Pro 


Asp 


Leu 


Tmt 
lyx 


Lys 


Gin 


Met 




Ltys 


Phe 


Axy 




114 








1 AO 
XOU 










1 A5 

X O «7 










1 QO 

X 7 U 








115 




































116 


AAG 


GTG 




AuA 


CAA 


GTG 


CGC 


CTT 


GCA 


AAA 


AAA 


AAC 


TTT 


uA^ 


AAA 


TTG 


802 


117 


Lys 


Val 


uXii 


illX 


Gin 


Val 


«xy 


Leu 


Ala 


Lys 


Lys 


Asn 


file 


Aop 


Lys 


Leu 




1 1 A 

J. X o 






1 95 










200 
^ u u 










205 










1 1 Q 

X ± 7 




































120 


AAG 


ATG 




V9 1 (j 


TGT 


CAA 


AAA 


GTG 


GAT 


CTT 


CTT 


GGA 




AuC 


AGA 


TGC 


850 


121 


Lys 


Me't 


Asp 




Cys 


Gin 


Lys 


Val 


Asp 


Leu 


Leu 


Glv 




OCX 


Ara 

rtx y 


Cys 




199 




2 10 

^ X\J 










215 

^ XiJ 










220 












123 




































124 


AAT 


CTC 




lux 


CAC 


ATG 


CTA 


GCA 


ACA 


TAC 


CAG 


ACC 


ar"p 
AU 1 




CTT 


CAT 


898 


125 




XiXS u 


Leu 


dex 


His 

nx o 




XjC U 


Ala 


X XAX 


xyx 


Gin 


Thr 


1 liX 




Leu 


His 




1 26 
X ^ o 


22R 
4b Z 3 










2 TO 










2 T 5 

^ J «j 










240 




127 




































128 


TTT 


TGG 




AAA 


ACT 


TCT 


CAC 


ACT 


ATG 


GCA 


GCC 


ATC 




uAu 


AGT 


TTC 


946 


-129 


Phe 




uXU 


Lys 


Thr 


Ser 


His 


Thr 


Met 


Ala 


Ala 


He 


XIX o 


m 11 


Ser 


Phe 




1 "^O 
X o u 










24 5 










2 50 










255 






131 




































132 


AAA 


GGT 


1 Al 


UAA 


CCA 


TAT 


GAA 


TTT 


ACT 


ACT 


TTA 


AAG 


aop 


TTa 

1 lA 


CAA 


GAC 


994 


133 


Lys 


Glv 


lyr 




Pro 


Tvr 
X yx 


Glu 


Phe 


Thr 


Thr 


Leu 


Lys 


iSXS X 


Leu 


Gin 


Asp 




X J 4 








260 










265 










270 








135 




































136 


CCT 


ATG 


AAA 


AAA 


TTA 


GTT 


GAG 


AAA 


GAA 


GAG 


AAG 


AAG 




ATP 
A X ^ 


AAC 


CAG 


1042 


137 


Pro 


Met 




Xiy s 


Leu 


Val 


Glu 


Lys 


Glu 


Glu 


Lys 


Lys 




Tie 

X xc 


Asn 


Gin 




1 '5 8 

X <J o 






275 




















285 










139 




































140 


CAG 


GAA 


AGT 


ACA 


GAT 


GCA 


GCC 


GTG 


CAG 


CAG 


CCG 


AGC 


CAA 


TTA 


ATT 


TCA 


1090 


141 


Gin 


Glu 


Ser 


Thr 


Asp 


Ala 


Ala 


Val 


Gin 


Gin 


Pro 


Ser 


Gin 


Leu 


lie 


Ser 




142 




290 










295 










300 












143 




































144 


TTA 


GAG 


GAA 


GAA 


AAC 


CAG 


CGC 


AAG 


GAA 


TCC 


TCT 


AGT 


TTT 


AAG 


ACT 


GAA 


1138 


145 


Leu 


Glu 


Glu 


Glu 


Asn 


Gin 


Arg 


Lys 


Glu 


Ser 


Ser 


Ser 


Phe 


Lys 


Thr 


Glu 




146 


305 










310 










315 










320 




147 




































148 


GAT 


GGA 


AAA 


AGT 


ATT 


TTA 


TCT 


GCC 


TTA 


GAC 


AAA 


GGC 


TCT 


ACA 


CAT 


ACT 


1186 


149 


Asp 


Gly 


Lys 


Ser 


He 


Leu 


Ser 


Ala 


Leu 


Asp 


Lys 


Gly 


Ser 


Thr 


His 


Thr 




150 










325 










330 










335 






151 




































152 


GCA 


TGC 


TCA 


GGA 


CCC 


ATA 


GAT 


GAA 


CTA 


TTA 


GAC 


ATG 


AAA 


TCT 


GAG 


GAA 


1234 



PAGE: 4 RAW SEQUENCE LISTING DATE: 12/18/96 

: ^ . TIME: 11:47:14 



INPUT SET: S14460.mw 



153 


Ala 


Cys 


Ser 


Glv 


Pro 


lie 


Asp 


Glu 


Leu 


Leu 


Asp 


Met 


Lys 


Ser 


Glu 


Glu 












•J 4 V 










345 










350 












































156 


GGT 


GCT 


TGC 


CTG 


GGA 


CCA 


GTG 


GCA 


GGG 


ACC 


CCG 


GAA 


CCT 


GAA 


GGT 


GCT 


1282 


157 


Glv 


Ala 


Cys 


Leu 


Glv 


Pro 


Val 


Ala 


Glv 


Thr 


Pro 


Glu 


Pro 


Glu 


Glv 


Ala 




1 Sfl 

X o 






355 










360 










365 










X «3 7 




































160 


GAC 


AAA 


GAT 


GAC 


CTG 


CTG 


CTG 


TTG 


ACT 


GAG 


ATC 


TTC 


AAT 


GCT 


TCC 


TCC 


1330 


1^1 

X D X 




ij y o 




A Qr> 






Leu 


Leu 


Ser 


Glu 


lie 


Phe 


Asn 


Ala 


Ser 


Ser 




1 CO 
X D Z 




J / U 










"57 K 










1ft n 
J o u 












X D J 




































X O 




(3AA 


GA<3 


(3GP 


GAG 




AGC 


AAA 


GAG 


TGG 


GCC 


GCT 


GTG 


TTT 


GGA 


GAC 


1378 


X O 3 




m ti 

uxu 


OX Li 


oxy 


Gl tl 
ux u 




•9CX 




Olu 


Trn 
1 xp 


Ala 


Ala 


Val 


Phe 


Glv 
vjxy 


Asp 




X D O 


J O 3 




















J ^ ij 










400 




X D / 




































X D O 






Vj 1 o 


AAn 


GAG 


PPA 


GTG 




ACT 


ATG 


GCC 


CTG 


GGA 


GAG 


CCA 


GAC 


1426 


X Q ? 




m n 


vox 




Gl 11 
uxu 


It X V 


Val 

V a X 


XT X V 


Thr 

X txx 




Ala 

AX a 


Leu 


Glv 
uxy 


Glu 


Pro 


Asp 




170 
X / u 




















AlO 

4 X u 










415 






X / X 




































179 

X f ^ 


V-» v_» 




GPP 
o v_» v_» 


PAG 


APA 


GGC 


TCA 


GGT 


TTC 


CTT 


CCT 


TCG 


CAG 


CTT 


TTA 


GAC 


1474 


1 7 ? 

X / J 


It X \J 


T \7C! 

Xiy a 


Ala 

nXCl 


n1 n 

V9XI1 


TVir 


Gl V 
uxy 


X 


Gl V 

ux jr 


Phe 






Ser 


Gin 


Leu 


Leu 


Asp 




1 7 >1 








4 ZU 










A 0 R 


















1 7 R 

X / 3 




































176 


CAA 


AAT 


ATG 


AAA 


GAC 


TTA 


CAG 


GCC 


TCG 


CTA 


CAA 


GAA 


CCT 


GCT 


AAG 


GCT 


1522 


1 77 


nl n 


Asn 




T 

Xiy o 




J_l c u 


Gl n 


Al a 
nXa 


^Ar 


XjC U 


Gl n 


Gl 11 
ux u 


Pro 


Ala 


Lys 


Ala 




1 7ft 

X f o 






*± J tj 










4 4 u 










445 










1 7 Q 




































1 AO 

X O Vi 




TP A 


riAP 


PTP 


ACT 


GCC 


TGG 


TTC 


AGC 


CTC 


TTC 


GCT 


GAC 


CTC 


GAC 


CCA 


1570 


1 ftl 

X O X 


nXd 


Ocx 


Asp) 


Leu 


Thr 


Ala 


Trp 


Phe 


Ser 


Leu 


Phe 


Ala 


Asp 


Leu 


Asp 


Pro 




1 ft9 
X o ^ 




3 U 










455 










460 












1 ft 

X O <J 




































1 ftA 

X O 4 


CTC 


TCA 


AAT 


CCT 


GAT 


GCT 


GTT 


GGG 


AAA 


ACC 


GAT 


AAA 


GAA 


CAC 


GAA 


TTG 


1618 


X o 


Leu 


Ser 


Asn 


Pro 


Asp 


Ala 


Val 


Gly 


Lys 


Thr 


Asp 


Lys 


Glu 


His 


Glu 


Leu 




186 


465 










470 










475 










480 




1 87 

X o / 




































188 


CTC 


AAT 


GCA 


TGA 


ATCTGTAC 


! CCTTCGGAGG 


GCACTCACAT GCCGCCCCCA 




1668 


1ft 9 

X o ? 


Leu 


Asn 


Ala 


* 




























190 




































191 




































192 


GCAGCTCCCC TGGGGGCTAG CAGAAGTATA AAGTGATCAG 


TATGCTGTTT TAATAATTAT 


1728 


193 




































194 


GTGCCATTTT AATAAAATGA AAGGGTCAAC 


! GGCCCTGTTA 


AAAAAAAAAA AAAAAAA 


1785 


195 




































1 

X 7 Q 




































197 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 2: 




















198 




































199 




(i) SEQUENCE 


CHARACTERISTICS : 


















200 








(A) 


LENGTH: 


483 


amino acids 














201 








(B) 


TYPE: amine 


» acid 


















202 








(D) 


TOPOLOGY: linear 


















203 




































204 




(ii) MOLECULE 


TYPE 


: protein 


















205 





































PAGE: 5 RAW. SEQUENCE LISTING DATE: .2/18/96 

- .'PATENT APPLICATION mm/467, 60S . TIME: 11:47:18 

INPUT SET: S14460.mw 

206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

207 

208 Met Ser Gly His Lys Cys Ser Tyr Pro Trp Asp Leu Gin Asp Arg Tyr 

209 15 10 15 
210 

211 Ala Gin Asp Lys Ser Val Val Asn Lys Met Gin Gin Arg Tyr Trp Glu 

212 20 25 30 
213 

214 Thr Lys Gin Ala Phe lie Lys Ala Thr Gly Lys Lys Glu Asp Glu His 

215 35 40 45 
216 

217 Val Val Ala Ser Asp Ala Asp Leu Asp Ala Lys Leu Glu Leu Phe His 

218 50 55 60 
219 

220 Ser lie Gin Arg Thr Cys Leu Asp Leu Ser Lys Ala lie Val Leu Tyr 

221 65 70 75 80 
222 

223 Gin Gin Arg lie Cys Phe Leu Ser Gin Glu Glu Asn Glu Leu Gly Lys 

224 85 90 95 
225 

226 Phe Leu Arg Ser Gin Gly Phe Gin Asp Lys Thr Arg Ala Gly Lys Met 

227 100 105 110 
228 

229 Met Gin Ala Thr Gly Lys Ala Leu Cys Phe Ser Ser Gin Gin Arg Leu 

230 115 120 125 
231 

232 Ala Leu Arg Asn Pro Leu Cys Arg Phe His Gin Glu Val Glu Thr Phe 

233 130 135 140 
234 

235 Arg His Arg Ala lie Ser Asp Thr Trp Leu Thr Val Asn Arg Met Glu 

236 145 150 155 160 
237 

238 Gin Cys Arg Thr Glu Tyr Arg Gly Ala Leu Leu Trp Met Lys Asp Val 

239 165 170 175 
240 

241 Ser Gin Glu Leu Asp Pro Asp Leu Tyr Lys Gin Met Glu Lys Phe Arg 

242 180 185 190 
243 

244 Lys Val Gin Thr Gin Val Arg Leu Ala Lys Lys Asn Phe Asp Lys Leu 

245 195 200 205 
246 

247 Lys Met Asp Val Cys Gin Lys Val Asp Leu Leu Gly Ala Ser Arg Cys 

248 210 215 220 
249 

250 Asn Leu Leu Ser His Met Leu Ala Thr Tyr Gin Thr Thr Leu Leu His 

251 225 230 235 240 
252 

253 Phe Trp Glu Lys Thr Ser His Thr Met Ala Ala lie His Glu Ser Phe 

254 245 250 255 
255 

256 Lys Gly Tyr Gin Pro Tyr Glu Phe Thr Thr Leu Lys Ser Leu Gin Asp 

257 260 265 270 
258 



